A Survey on Arabic Named Entity Recognition: Past, Recent Advances, and Future Trends
نویسندگان
چکیده
As more and Arabic texts emerged on the Internet, extracting important information from these is especially useful. a fundamental technology, Named entity recognition (NER) serves as core component in extraction while also playing critical role many other Natural Language Processing (NLP) systems, such question answering knowledge graph building. In this paper, we provide comprehensive review of development NER, recent advances deep learning pre-trained language model. Specifically, first introduce background including characteristics existing resources for NER. Then, systematically NER methods. Traditional systems focus feature engineering designing domain-specific rules. years, methods achieve significant progress by representing via continuous vector representations. With growth model, yields better performance. Finally, conclude method gap between languages, which helps outline future directions
منابع مشابه
Arabic Named Entity Recognition
Stemming is the process of reducing words to their stems or roots. Due to the morphological richness and complexity of the Arabic language, stemming is an essential part of most Natural Language Processing (NLP) tasks for this language. In this paper, we study the impact of different stemming approaches on the Named Entity Recognition (NER) task for Arabic and explore the merits, limitations an...
متن کاملA Survey of Arabic Named Entity Recognition and Classification
As more and more Arabic textual information becomes available through the Web in homes and businesses, via Internet and Intranet services, there is an urgent need for technologies and tools to process the relevant information. Named Entity Recognition (NER) is an Information Extraction task that has become an integral part of many other Natural Language Processing (NLP) tasks, such as Machine T...
متن کاملFine-grained Arabic named entity recognition
Named Entity Recognition (NER) is a Natural Language Processing (NLP) task, which aims to extract useful information from unstructured textual data by detecting and classifying Named Entity (NE) phrases into predefined semantic classes. This thesis addresses the problem of fine-grained NER for Arabic, which poses unique linguistic challenges to NER; such as the absence of capitalisation and sho...
متن کاملNamed Entity Recognition for Arabic Social Media
The majority of research on Arabic Named Entity Recognition (NER) addresses the the task for newswire genre, where the language used is Modern Standard Arabic (MSA), however, the need to study this task in social media is becoming more vital. Social media is characterized by the use of both MSA and Dialectal Arabic (DA), with often code switching between the two language varieties. Despite some...
متن کاملNamed Entity Recognition for Dialectal Arabic
To date, majority of research for Arabic Named Entity Recognition (NER) addresses the task for Modern Standard Arabic (MSA) and mainly focuses on the newswire genre. Despite some common characteristics between MSA and Dialectal Arabic (DA), the significant differences between the two language varieties hinder such MSA specific systems from solving NER for Dialectal Arabic. In this paper, we pre...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Knowledge and Data Engineering
سال: 2023
ISSN: ['1558-2191', '1041-4347', '2326-3865']
DOI: https://doi.org/10.1109/tkde.2023.3303136